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ABSTRACT 


The  convergence  of  the  class  of  direct  interpolatory  iterations  I for 
a simple  zero  of  a non-linear  operator  F in  a Banach  space  of  finite  or  in- 


finite dimension  is  studied 


A general  convergence  result  is  established  and  used  to  show  that  if  F 
is  entire  the  "radius  of  convergence"  goes  to  infinity  with  n while  if  F is 


analytic  in  a ball  of  radius  R the  radius  of  convergence  increases  to  at 
least  R/2  with  n. 


1.  INTRODUCTION 


- 

I 


I 


We  study  the  convergence  of  the  class  of  direct  interpolatory  itera- 


tions I , where  I is  of  order  n and  n £ 3,  for  a simple  zero  of  a non- 
n n 


linear  operator  F in  a Banach  space  of  finite  or  infinite  dimension.  For 
n ■ 2,  see  Traub  and  Wozniakawski  [77a]. 


We  establish  a basic  convergence  theorem  for  I and  apply  it  to  two 

n 


classes  of  problems.  If  F is  an  entire  function  of  growth  order  p,  then  the 

1/P 


"radius  of  convergence"  goes  to  infinity  at  least  as  fast  as  n . We  show 
this  result  is  sharp  for  all  odd  n and  p * 1.  If  F is  analytic  in  a disk  of 
radius  R,  then  the  "radius  of  convergence"  increases  to  at  least  R/2  with  n. 
This  result  is  also  sharp  for  all  odd  n. 


The  calculation  of  the  next  iterate,  xi+j  * I^x^)  , requires  the  solution 


of  a polynomial  operator  equation  of  degree  n-1.  In  Traub  and  Wozniakowski 
[77b]  we  consider  one  way  in  which  this  can  be  done. 
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2.  INTERPOLATORY  ITERATION  I 

n 

We  consider  the  solution  of  the  non-linear  equation 

(2.1)  T(x)  - 0 

where  F:  D c -*  and  B^,  B2  are  real  or  complex  Banach  spaces  of  dimension 
N,  N ■ dim(B^)  - dim(B2),  1 SN  £ +“.  We  solve  (2.1)  by  one-point  stationary 
iterations  without  memory  in  the  sense  of  Traub  [64].  Let  x^  be  a sufficiently 
close  approximation  to  a simple  zero  a,  i.e.,  F(cr)  “ 0 and  F1  (a)  exists 
and  is  bounded.  Suppose  that  the  next  approximation  x^+^  depends  on  the 
"standard  information"  on  F,  i.e., 

(2.2)  xi+1  - w(xif  ^(x^F)) 

where  cp  is  an  iteration  operator  and 


(2.3)  ;F)  - [F (x)  ,F*  (x) F(n-1)  (x)  } for  n 2 2 


In  Traub  and  Wozniakowski  [76b]  we  showed  that  standard  Information  is 
optimal  (in  a sense  made  precise  in  that  paper)  among  all  linear  information 
sets  which  use  n evaluations.  Any  iteration  based  on  (2.3)  has  order  of  con- 
vergence no  greater  than  n.  This  was  first  proved  by  Traub  [64]  and  Rung  and 
Traub  [74]  for  scalar  problems  and  by  Wozniakowski  [74]  for  multivariate  and 
abstract  problems,  see  also  Traub  and  Wozniakowski  [76a]  who  use  a non- asymptotic 
definition  of  order  and  derive  strict  lower  and  upper  bounds  on  complexity.  In 
this  paper  we  study  the  convergence  of  a class  of  iterations  which  use  standard 
information.  The  optimal  order  is  achievable  by  an  interpolatory  iteration  I 


defined  as  follows 
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■ I 1 1 1 


(i)  construct  a polynomial  of  degree  s n-1  which  agrees  with  the 
standard  information  on  F,  i.e.. 


(2.4)  w<j)(xt)  - F(j)(xi),  j - 0,1 n-1, 


(ii)  define  the  next  approximation  x^+1  as  a zero  of  w^  WjCx^)  “ 0» 
with  a certain  criterion  of  its  choice. 


We  shall  write  x . , . ■ I (x . ;F)  . From  (2.4)  we  get 
i+1  n i 


(2.5)  wt(x)  - F(x±)  + F'(xi)(x-xi)  + ...  + ^ (x^  (x-x^”  1 


For  n a 2 we  obtain  Newton  iteration  since  the  unique  zero  of  w^  is  given  by 


xi+l  “ xi  " F' (xi^"  *F(x.).  Throughout  this  paper  we  assume  n s 3.  For  n * 2, 
see  Traub  and  Wozniakowski  [7  7a]. 


Remark  2.1 

Inverse  interpolation  can  also  be  used  in  which  case  (2.4)  is  replaced  by 


1 


where  g(x)  ■ F~(x)  is  the  inverse  operator  to  F.  Define  xi+1  ■ w^O).  The 
problem  of  solving  the  "polynomial  operator  equation"  (2.5)  is  then  eliminated 
but  the  derivatives  of  the  inverse  function  must  be  computed.  See  Traub  [64, 


Appendix  B and  Section  11.2]  for  the  case  N < • and  Brent  and  Rung  [76]  for 


We  deal  with  the  character  of  convergence  of  the  interpolatory  iteration 


Define 


4 


(J) 


(2.6)  A..  - A.(0  - supl^(tt)-1  F-:7^||,  J-2,3,... 

J J x€J 


f(j) 


whenever  FVJ  (x)  exists.  Let  q be  any  number  such  that  0 < q < 1 and  let 


{max(-^,  1)  if  N 

1 if  N 


m +oo 
< +®  , 


2 if  N - +« 

1 if  N < +“  . 


Theorem  2.1 

If  F is  n- times  differentiable  in  J,  n £ 3,  and 


(2.7) 


A„(l+q)V 


nji-1 


6n  + A2,r6  < lf 


(2.8)  xQ  € J 


then  the  polynomial  w^  has  a zero  in 


MMI*  q*T]  for  any  i.  Define 


x^+^  as  any  zero  of  w^  in  J^. 


Then 


(2.9)  lim  xt  - o,  e1+1  s q^  where 


A (l+q)n"V*“1ql(n‘l) 


11  i-*n(i«,)»-1r”-1ql<n-I)-A2,r<,t 


* q.  v. 


i* 


(2.10)  *i+1  * Cl  ne"  where 


1.«  ■ Vl+-lf)n/<1'Vi+i>,  <=!..  ■ V 


• — — 


k 


(2.11)  xl+l-»  - (-1)"  ^'1F<*'(,)(,1..)"  + od^-f). 


Proof 


Define 


(2.12)  Rj(x;y) 


J1  F(j)(yft(x-y))(x-y)j  dt 


for  x,y  € J and  j “ 2 and  n.  Assume  by  induction  that  xt  € J^.  This  holds 
for  i - 0 since  J ■ J.  From  (2.4)  we  get  the  error  formula 


(2.13)  F (x)  ■ w^x)  + Rn(x;xi), 

see  Rail  [69,  p.124].  We  want  to  show  that  w^  has  a zero  in  Jt+j.  The  equa- 
tion w^(x)  * 0 in  Ji+^  is  equivalent  to  F(x)  ■ Rn(x;xi).  Since  F(x)  ■ F'(o)(x-o)  + 
R2(x  ; rf)  we  get  the  equation 

(2.14)  x - H(x)  — a+  F' (or)_1{Rn(x;xi)  - R2(x;»)J. 

We  verify  H(Jl+1)  c j.+1>  Let  x 6 Ji+1*  Then  from  (2.6)  and  (2.12)  we  get 
IM (x) — or ||  * Anl|x-xi|P  + A2|(x-fv|f  *=  An(q1+1+ql)nTJl  + A2q2(i+1)r2  - 

. ,«*-»  ♦ a2,-,‘>  * ,l+1  r 


due  to  (2.7).  Thus  H(Jl+1)  C J^. 

If  N < +®  then  from  the  Brouwer  fixed  point  theorem  (see  e.g.,  Ortega 
and  Rheinboldt  [70,  p. 161])  there  follows  the  existence  of  the  solution  of 
(2.14),  xi+1  € Ji+j*  If  N * +®  we  shall  use  the  contraction  mapping  theorem 
(see  e.g.,  Ortegaand  Rheinboldt  [70,  p.120]).  From  (2.13)  and  (2.14)  we  get 
for  x £ J. . , 


H'(x)  - F'(o)‘1[(F'(x)-w^(x))  - (F' (x)-F' (a))]. 


From  (2.7)  we  get 


Ifcl'Cx)  ||  * nAjlK-xjr  + 2A2||X-a||  * nAjl+q)"-1  I*’1  + 2A2qr  < 1. 

Hence  H is  contractive  on  and  there  exists  a unique  solution  of 

(2.14)  in  J1+^  for  any  i. 

Now  let  N ^ +«°.  Setting  x “ xi+l  in  (2.14)  we  get 

®i+l  * AJK+r*jP  + A2®i+1  5 An(1+q)  ^ q(  ^ (ei+l+ei)  + r®i+l* 


Thus  e^+^  £ where  ^ is  given  in  (2.9)  and  s q due  to  (2.7).  Hence 

lim  e ■ 0.  Returning  to  (2.14)  we  have 
i 4 


e . , , SC.  e . where  C 
i+1  i,n  i 


l.n  ' 


Since  C.  _ is  bounded,  e,,./e  5 C,  e”  * ■*  0.  Hence  lim  C.  =*  A which 

i,n  i+r  i i,n  i ^ i,n  n 

proves  (2 . 10) . Next  observe  that 

Rj(xi+l;xi>  " ^ + od^-alf1). 


Thus  (2.14)  for  x ■ xi+l  8ives 


xi+1-a  - (-l)n  F'(o)"1  - n,-^a-^  (xt-a)n  + o(||Kt+1-a||  + l^-alf1) 


which  proves  (2.11)  and  theorem  2.1. 


For  N ■ +®  we  proved  that  there  exists  a unique  zero  of  w,  in  J.  for  any 


i.  For  the  multivariate  case,  N < +®,  this  need  not  be  true.  The  next  ap 


proximation  x^+1  can  be  defined  as  any  zero  of  wi  in  J^.  However,  it  is  easy 
to  show  that  for  large  i the  polynomial  w.  has  a unique  zero  in  J..  I 


The  computation  of  x^+^  requires  the  solution  of  the  "polynomial  operator 
equation"  w^(x)  =•  0.  There  are  a number  of  ways  for  dealing  with  the  problem 
of  solving  this  equation  numerically.  One  is  to  apply  a number  of  Newton 


iterations  (say  k)  starting  with  xt  and  taking  the  kth  Newton  iterate  as  xi+1 
Traub  and  Wozniakowski  [77b]  study  the  convergence  and  complexity  of  such 


"Interpolatory-Newton"  iterations 


If  we  additionally  assume  that  F is  (n+l)-times  differentiable  in  J then 
using  the  same  technique  it  may  be  shown  that 


where  ||6.  ||  * c||  x 


In  theorem  2.1  we  assume  that  xQ  € J.  Note  that  the  radius  of  J depends 

on  F and  q.  For  a given  problem  F one  can  ask  what  is  the  optimal  value  of  q 

% 

which  maximizes  r . in  the  next  section  we  deal  with  that  problem  for  any 


value  of  n 
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3.  BALL  OF  CONVERGENCE  OF  I 

n 

In  the  previous  section  we  proved  that  if  F is  sufficiently  regular  and 

if  an  initial  approximation  x^  belongs  to  the  ball  J then  the  interpolatory 

iteration  I converges.  Of  course  we  would  like  to  have  J as  large  as  possible, 
n 

To  make  this  idea  more  precise  we  define  a ball  of  convergence  as  follows. 
Definition  3.1 

A ball  Jn  ■ Jn(F)  " fr:  ||x-«|l  < rn5  where  =*  ^(F)  is  said  to  be  a ball 

of  convergence  or  the  interpolatory  iteration  I for  a function  F if  for  any 

x € J the  sequence  x.  , 3 I (x. ;F)  converges  to  the  solution  et  and  for  every 
0 n i+l  n i 

e > 0,  there  exists  x.  such  that  |lxn-o||  ^ T + e and  x. . - = I (x.  ;F)  is  not 

0 Ir0  11  n i+l  n i 

* 

convergent  to  a.  is  called  the  radius  of  the  ball  of  convergence  or  the 
radius  of  convergence  for  short.  ■ 

Of  course  the  ball  of  convergence  can  be  defined  for  any  iteration  # in 

the  same  way.  In  fact  it  is  sometimes  more  convenient  to  have  the  concept  of 

domain  of  convergence  D where  a € D and  starting  from  any  point  xQ  € D the 

sequence  x^+^  ■ $(x^;F)  converges  to  a.  In  general  it  is  very  hard  to  find  D 

and  therefore  we  restrict  ourselves  only  to  the  case  when  D is  a ball  with 

center  at  a;  compare  Ortega  and  Rheinboldt  [70,  p.236]. 

Note  that  if  F is  a polynomial  of  degree  in-1  then  the  interpolatory  poly- 

* 

nomial  wt  3 F and  we  get  convergence  for  any  xQ.  This  means  ■ +“  and 

J (F)  ■ B . To  exclude  this  exceptional  case  we  shall  assume  throughout  this 
n v 1 

section  that  F is  not  a polynomial. 

Recall  that  F:  D -*  D c Bj  and 


. --.j. 


(3.1)  A = A (r)  3 sup  |^'(Qf) 
n n - _ . % 

x€J  (r) 


.^-1  III M 


|,  n 3 2,3,... 


for  r such  that  J(r)  = {x:  ||x-a||  < r)  c:  d. 


Define  a function  g(r)  3 g(r,q)  as 


(3.2)  g(r) 


An(r) ( 1+q) nrn” 1 


*n  + A2(r)qr6 


where  and  6 are  defined  in  Section  2.  Note  that  g is  strictly  increasing  and 


Je 

g(0)  = 0,  g(+°°)  3 +®.  Thus  there  exists  a unique  r 3 r (q)  > 0 such  that 


g(r  ) = 1.  Let 


Je  ~k 

(3.3)  r 3 max  r (q) . 
n 0<q<l 


Applying  theorem  2.1  with  the  q which  maximizes  r we  can  define  the  radius 


f in  (2.7)  as  r = rn“e  ^or  any  sufficiently  small  positive  e.  From  theorem  2.1 


the  radius  F of  the  ball  of  convergence  J (F)  can  be  bounded  below  by 
n n 


4 

(3.4)  r Sr. 


* * 

We  shall  see  that  for  some  problems  F — r for  large  n which  indicates  that 


theorem  2.1  is  asymptotically  sharp.  Furthermore  we  shall  show  that  for  some 


problems  rn  increases  with  n or  even  tends  to  infinity  with  n. 


We  consider  two  cases  depending  on  the  domain  of  F as  follows: 


(i)  F is  an  entire  function,  i.e.,  D 3 B^  and  B^,  B2  are  Banach  spaces 


over  the  complex  or  real  field. 


(il)  F is  an  analytic  function  in  a finite  domain  D 3 {x:  |(x-or||  < r] 


where  0 < R < +“. 


Case  (i).  Let  F be  an  entire  function.  Thus 
00 

(3.5)  F (x)  - 2,  F(1)  («)(*-«> l» 

i-1 

Definition  3.2 

We  say  F has  the  growth  order  p.  0 < p < +“  and  the  type  t.  t > 0.  of  its 
order  if  1 

for  a constant  M and  i - 0,1 • 


Compare  with  the  definition  for  the  scalar  case  in  Hille  [62].  For  the 
sake  of  simplicity  we  do  not  consider  the  growth  order  p “ 0 or  p * +<*>.  However 
it  is  possible  to  analyze  such  cases  as  well. 

Define 


.6)  f(z)  - z*.  z € (L- 


(3 


i-1 


Then 


(3.7)  |^(J)(x)  ||  * fU,(||ic-a||> 


<J) 


for  x € Bj  and  j - 0,1,., 


We  need  bounds  on  the  growth  of  f ^ . From  Hille  [62,  p.183]  follows  that 


there  exists  a constant  c^  > 0 such  that 


(3.8)  max^  | £ (*) | - f(r)  * CjT^exp^r**) , Vr  > 0. 


Lenana  3.1 


Let  " max(0,  1-l/p).  Then 


(3.9)  f , ft?.  Jl-J  ).  f (2a 
n • > \ v P 


f(2“r). 


Proof 


Note  that 


f ( } (r)  » MC  c -^SEi- 

i,n  a 

i-0 


where  v 


1/p,  C - tV  and  ci>n  - [(n^)n']  . 


It  may  be  shown  that 


n'.  * (J^1)  n\  * 2 0+1  ni,  Vh.i. 


If  p < 1 then  c.  £ (nl)  and 
l,n 


f(n)  (r)  ^ C" 


(ni)1/p 


which  proves  (3.9)  with  ^ ■ 0.  Assume  p s 1.  Then  g,  ■ 1-v  > 0 and 


c,  s [ni  2n+i)u.  Hence 
lfn 


”•  (n'.)1/p 


which  proves  (3.9). 


We  are  ready  to  prove  the  follow*  r theorem 


12 


Theorem  3.1 

If  F has  the  growth  order  p and  the  type  t of  Its  order  and  if 

l±£ll 


(3.10)  q - qn  ■ t(c2n)  p exp(2UPTc2n/p) 


(3.10) 

q " % 

where  c 

2 and  c. 

(3.11) 

c22wt 

and  |ji. 

6 and  c 

(3.12) 

r“rn 

Proof 

.11)  c22lipT  exp (2c22*apt+1)  < 1,  c3  < 2p  ^(c^t2^) 


and  n,  6 and  are  defined  as  above  then  theorem  2.1  holds  with 

V P/ix„/n  \ u. 


We  want  to  estimate  g defined  by  (3.2).  Since  ■ o(n),  8n  ■ 1 in  (2.7)  and 
(3.2)  for  large  n.  After  some  algebraic  manipulations  we  get  from  Lemma  2.2 


g((c2n)p)  - o(tc22^T  exp(2c22tAPT+l)  }P  "^n'T/p  ) + * 'c" 


13 


By  Stirling's  formula 


n'.e”  - (l+o(l)) 


and  due  to  (3.11),  g((c2n)^p)  < 1 for  large  n.  This  means  that  (2.7)  holds 

for  f ■ f ■ (c_n)  1//p( l+o(l))  for  every  n.  ■ 

n £ 

Theorem  3.1  states  a "type  of  global  convergence"  of  the  interpolatory  itera- 
tion I . The  iteration  In  is  convergent  for  fn  - (c2n)ly^p  which  tends  to  infinity 
with  n and  the  growth  of  the  radius  depends  on  the  growth  order  p. 


gUtegwwi 


13- 


From  (3.3)  and  (3.4)  we  get 


Corollary  3.1 

* 

If  F has  the  growth  order  p then  the  radius  r of  the  ball  of  conver- 
gence of  tne  interpolatory  iteration  In  satisfies 

(3.13)  T*  s (c2n)l/p (l+o(l)),  Vh. 

We  want  to  prove  that  (3.13)  is  sharp  for  n odd  and  p ■ 1. 


Theorem  3.2 

There  exists  a problem  F of  growth  order  p ■ 1 for  which  (3.13)  is  sharp 
for  all  odd  n. 


Proof 


(3.14)  F (x)  - e -a,  a > 0, 


for  real  x.  The  growth  order  p and  the  type  r are  now  equal  to  unity.  From 


Corollary  (3.1)  we  get 


r (F)  * c,n( l+o(l)) 
n z 


where  c^  exp(2c2+l)  < 1 which  means  c^  < 0.23.  We  shall  show  that 


(3.15)  rn(F)  Sc4n(l+o(l)) 


for  n odd  where  c^  exp(l+c^)  > 1 which  means  c^  > 0.28. 

Let  w be  the  interpolatory  polynomial  of  degree  s n-1  such  that 
w^W)  - F^(xn)  for  j - 0,1 n-1.  Then 


w(x)  - e °Sn-1(x-x0)  - a 


I 
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where 


see  Newman  and  Rivlin  [72].  From  this  it  follows  that  S2k  does  not  have  real 
zeros  and  S2k_1  does  have  a unique  real  zero  which  we  label  by  z2k-l*  111  is 


known  that 


z2k-l  " ' c5(2k-l)(l+o(l)),  Vk, 


where  c5  exp(l+c5>  ■ 1,  c5  4 0.28,  see  Szego  [24]  and  Rosenblum  [57],  Observe 
that  S^k(x)  * S2k-l*x^  which  8ives 

21c 

Z2k-1 

min  S2k(x)  - S2k(z2k.1)  - • 

x 

We  want  to  find  such  that  the  polynomial  w does  not  have  real  zeros  which 
means  that  the  interpolatory  iteration  is  not  defined  at  x^  and  ^ • 

The  equation  w(x)  ■ 0 is  equivalent  to 


Sn-!(X'X0)  ’ ae  * 


Let  Xg  - c^n.  Then 


0.16) 


Note  that  1 — - c.e(l+o(l))  and  Jae  4 ■ e (l+o(l)).  Since 

n‘V(n-l)'.  5 

c^e  - exp(-Cj)  and  c5  < c^  then  (3.16)  is  always  positive  for  large  n.  Hence 


(3.15)  and  theorem  3.2  are  proven.  ■ 

However,  it  may  be  shown  that  r*(F)  • +®  for  n even  for  the  problem  (3.14). 
The  sharpness  of  (3.13)  for  n even  or  p I4  1 is  open. 


From  (3.17)  we  get 


for  large  n.  This  means  that  (2.7)  holds  for  r • Tn  » -y-(R-eXl+o(l)) 

for  all  n.  ■ 


Remark  3.1 

It  Is  possible  to  get  a slightly  sharper  estimate  of  Fr  in  (3.13).  It 
My  be  shown  that 


r 

n 


In (nCg) 
2n 


(l+o(l)) 


) 


where  * c6(M,C)  is  a positive  constant. 


Since  « can  be  arbitrarily  small,  theorem  3.3  states  that  Fr  — ^ R for 

large  n.  This  means  that  the  radius  r is  about  one  half  of  the  domain  radius 

n 

R.  Once  more  this  gives  a "type  of  global  convergence".  From  (3.4)  we  get 
Corollary  3.2 

“ft 

If  F satisfies  (3.17)  then  the  radius  F of  the  ball  of  convergence  of 

n 

the  interpolatory  iteration  I satisfies 

(3.21)  F*  2^R(l+o(l)),  *h.  ■ 

n i 

We  now  show  that,  in  general,  (3.21)  is  sharp  for  n odd. 

Theorem  3.4 

There  exists  a problem  F for  which  (3.21)  is  sharp  for  all  odd  n. 

Proof 

1 

1-f.v 


Let 

(3.22)  F (x) 


0 < C < 1 


We  shall  show  that 


The  interpolatory  polynomial  w is  given  by 


The  equation  w(x)  * 0 is  equivalent  to 


It  is  straightforward  to  verify  that  (3.24)  does  not  have  zeros  in  [-R,R] 
This  implies  xn  ^ J (F)  and 


However,  it  may  be  shown  that  f (F)  ■ R for  n even.  Thus  the  sharpness  o£ 


(3.21)  is  open  for  n even 
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